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Abstract. The problem of fc-minimisation for a DFA M is the computa- 
tion of a smallest DFA N (where the size \M\ of a DFA M is the size of the 
domain of the transition function) such that L{M) A L{N) C E'^'', which 
means that their recognized languages differ only on words of length less 
than k. The previously best algorithm, which runs in time 0(|M|log2 n) 
where n is the number of states, is extended to DFAs with partial transi- 
tion functions. Moreover, a faster C9(|M| logn) algorithm for DFAs that 
recognise finite languages is presented. In comparison to the previous al- 
gorithm for total DFAs, the new algorithm is much simpler and allows 
the calculation of a fc-minimal DFA for each k in parallel. Secondly, it 
is demonstrated that calculating the least number of introduced errors 
is hard: Given a DFA M and numbers k and m, it is NP-hard to de- 
cide whether there exists a fc-minimal DFA N with \L{M) A L{N)\ < m. 
A similar result holds for hyper-minimisation of DFAs in general: Given 
a DFA M and numbers s and m, it is NP-hard to decide whether there 
exists a DFA N with at most s states such that \L{M) A L{N)\ < m. 
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1 Introduction 

Deterministic finite automata (DFAs) are one of the simplest devices recognising 
languages. The study of their properties is motivated by (i) their simplicity, 
which yields efficient operations, (ii) their wide-spread applications, (iii) their 
connections to various other areas in theoretical computer science, and (iv) the 
apparent beauty of their theory. A DFA M is a quintuple (Q, S, 5, qo, F), where 
Q is its finite state-set, U is its finite alphabet, 5: Q x — > Q is its partial 
transition function, qo ^ Q is its starting state, and F C Q is its set of accepting 
states. The DFA M is total if S is total. The transition function 6 is extended 
to S: Q X S* Q in the standard way. The language L(M) that is recognised 
by the DFA M is L{M) = {w \ S{qo, w) e F}. 
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Two DFAs M and N arc equivalent (written as M ee N) if L{M) = L{N). 
A DFA M is minimal if all equivalent DFAs are larger. One of the classical 
DFA problems is the minimisation problem, which given a DFA A/ asks for the 
(unique) minimal equivalent DFA. The asymptotically fastest DFA minimisa- 
tion algorithm runs in time 0(1^1 nlogn) and is due to Hopcroft |9l7j . where 
n = \Q\; its variant for partial DFAs is known to run in time OdAfj logn). 

Recently, minimisation was also considered for hyper-equivalence [213] , which 
allows a finite difference in the languages. Two languages L and L' are hyper- 
equivalent if \L A L'\ < oo, where A denotes the symmetric difference of two sets. 
The DFAs M and N are hyper-equivalent if their recognised languages are. The 
DFA M is hyper-minimal if all hyper- equivalent DFAs are larger. The algorithms 
for hyper-minimisation |3l2j were gradually improved over time to the currently 
best run-time OdMllog^n) j8l6| . which can be reduced to OdMllogrt) using 
a strong computational model (with randomisation or special memory access). 
Since classical DFA minimisation linearly reduces to hyper- minimisation j8j, an 
algorithm that is faster than OdMllogn) seems unlikely. Moreover, according 
to the authors' knowledge, randomisation does not help Hopcroft's [S] or any 
other DFA minimisation algorithm. Thus, the randomised hyper-minimisation 
algorithm also seems to be hard to improve. 

Already [3] introduces a stricter notion of hyper-equivalence. Two languages 
L and L' are k- similar if they only differ on words of length less than k. Anal- 
ogously, DFAs are /c-similar if their recognised languages are. A DFA M is 
k-minimal if all fc-similar DFAs are larger, and the k- minimisation problem asks 
for a fc-minimal DFA that is fc-similar to the given DFA M . The known algo- 
rithm 1^ for fc-minimisation of total DFAs runs in time OdAfj log^ n), however 
it is quite complicated and fails for non-total DFAs. 

In this contribution, we present a simpler fc-minimisation algorithm for gen- 
eral DFAs, which still runs in time OdMI log^n). This represents a significant 
improvement compared to the complexity for the corresponding total DFA if 
the transition table of M is sparse. Its running time can be reduced if we al- 
low a stronger computational model. In addition, the new algorithm runs in 
time OdMl log n) for every DFA M that recognises a finite language. Finally, 
the new algorithm can calculate (a compact representation of) a fc-minimal DFA 
for each possible fc in a single run (in the aforementioned run-time) . Outputting 
all the resulting DFAs might take time Q{n\M\ log^ n). 

Although fc-minimisation can be efficiently performed, no uniform bound on 
the number of introduced errors is provided. In the case of hyper-minimisation, 
it is known jlOj that the optimal (i.e., the DFA committing the least number 
of errors) hyper-minimal DFA and the number of its errors m can be efficiently 
computed. However, this approach does not generalise to fc-minimisation. We 
show that this is for a reason: already the problem of calculating the number m 
of errors of an optimal fc-minimal automaton is NP-hard. 

Finally, for some applications it would be beneficial if we could balance the 
number m of errors against the compression rate Thus, we also consider the 
question whether given a DFA M and two integers s and m there is a DFA TV 
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with at most s states that commits at most m errors (i.e., \L{M) A L{N)\ < m). 
Unfortunately, we show that this problem is also NP-hard. 




2 Preliminaries 

We usually use the two DFAs M = {Q,S,6,qo,F) and N ^ {P, S, ^i,po,F'). 
We also write S{w) for 6{qo,'w). The right-language Lf,[{q) of a state q G Q is 
the language LM{q) = {w \ S{q,w) G F} recognised by M starting in state q. 
Minimisation of DFAs is based on calculating the equivalence = between states, 
which is defined by g = p if and only if £a/(<z) = Lpf{p). Similarly, the left 
language of q is the language d^^{q) = {w \ 6{w) = q} of words leading to q 
in M. 

For two languages L and L' , we define their distance d{L, L') as 

d{L, L') = min {£ \ L D S-^ = L' D Z"-^} , 

where min0 = oo. Actually, d is an ultrametric. The distance d can be extended 
to states: d{q,p) = d{LM{q), Ln{p)) for g G Q and p E P. It satisfies the simple 
recursive formula: 

max {d{S{q, a), ii{p, a)) \ a G S} otherwise. 

Since d is an ultrametric on languages, ([T]) yields that the distance d(qi,q2) be- 
tween 5i, 52 G Q in the DFA M is either infinite or small. Formally, d{qi, 52) = ck) 
or d{qi,q2) < \Q\. 

The minimal DFAs considered in this paper are obtained mostly by state 
merging. We say that the DFA TV is the result of merging state q to state p 
(assuming q p) in M ii N is obtained from M by changing all transitions 
ending in q to transitions ending in p and deleting the state q. If q was the starting 
state, then p is the new starting state. Formally, P = Q \ {q}, F' = F \ {q}, and 

jp if5{r,a) = q jp if go = 9 

I 5{r, a) otherwise, I qo otherwise. 

The process is illustrated in Fig. [TJ 

Finally, let in-level M{q) be the length of the longest word leading to q 
in M. If there is no such longest word, then in-levelM^q) = 00. Formally, 
in-levelM{q) = sup{|w| | w G S~^{q)} for every q E Q. The structural char- 
acterisation of hyper-minimal DFAs O Sect. 3.2] relies on a state classification 
into kernel and preamble states. The set Ker(M) of kernel states consists of all 
states q G Q with in-levelM{q) = 00, whereas the remaining states are preamble 
states. Roughly speaking, the kernels of two hyper-equivalent and hyper-minimal 
automata are isomorphic in the standard sense, and their preambles are also iso- 
morphic except for acceptance values. 
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Fig. 1. Merging state G into /. 

3 Efficient fc-minimisation 

3.1 fe-similarity and fc-minimisation 

Two languages L and L' are k-similar if they only differ on words of length 
smaller than fc, and the two DFAs M and N are /c-similar if their recognised 
languages are. The DFA M is k-minimal if all fc-similar DFAs are larger. In 
this section, we first give a general simple algorithm fc-MiNiMiSE that computes 
a /c-minimal DFA that is fc-similar to the input DFA M . Then we present a data 
structure that allows a fast, yet simple implementation of this algorithm. 

Definition 1. For two languages L and L' , we let L L' <=^ d{L,L') < k. 

The hyper-equivalence relation [3| can be now defined as ^ — IJfc ^fe- Next, 
we extend fc-similarity to states. 

Definition 2. Two states q £ Q and p G P are k-similar, denoted by q P, if 

d{q,p) + miT\{k^ in-level M{q) Tin-level isi{p)) <k . 

While is an equivalence relation on languages, it is, in general, only a 
compatibility relation (i.e., reflexive and symmetric) on states. On states the 
hyper-equivalence is not a direct generalisation of fc-similarity. Instead, p ^ q ii 
and only if LM{q) ^ L]\j{p). We use the fc-similarity relation to give a simple al- 
gorithm fc-MiNiMiSE(M), which constructs a fc--minimal DFA (see Algorithm [1]) . 
In Section [3.21 we show how to implement it efficiently 

Theorem 1. fc-MiNiMiSE returns a k-minimal DFA that is k-similar to M. 

3.2 Distance forests 

In this section we define distance forests, which capture the information of the 
distance between states of a given minimal DFA M. We show that fc- minimisation 
can be performed in linear time, when a distance forest for M is supplied. We 
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Algorithm 1 fc-MiNiMiSE(A/) with minimal AI 

1: calculate on Q 

2: N ^ M 

3: while q ~fe p for some q,p £ P and q ^ p do 
4: if in-level M{q) > in-level m{p) then 
5: swap q and p 

6: N ^ MERGE{N,q,p) 



start with a total DFA Af because in this case the construction is fairly easy. In 
Section [331 we show how to extend the construction to non-total DFAs. 

Let be a forest (i.e., set of trees) whose leaves are enumerated by Q and 
whose edges are weighted by elements of N. For convenience, we identify the leaf 
vertices with their label. For every g £ Q, we let tiee{q) e J-" be the (unique) tree 
that contains q. The level level(w) of a vertex w in i e 7^ is the maximal weight 
of all paths from v to a leaf, where the weights are added along a path. Finally, 
given two vertices vi , V2 of the same tree t £ J-', the lowest common ancestor of 
vi and V2 is the vertex lca(i;i, V2). 

Definition 3 (Distance forest). Let T he a forest whose leaves are enumerated 
by Q. Then J- is a distance forest for M if for every q,p Q we have 

d{q p) = |l'^'^°Klca(g,p)) j/trec(g) = trec(p), 
1 oo otherwise. 




In order to construct a distance forest we use ([T]) to calculate the distance. 
Mind that M is minimal, so there are no states with distance 0. In phase £, we 
merge all states at distance exactly £ into one state. Since we merged all states of 
distance at most i — I in the previous phases, we only need to identify the states 
of distance 1 in the merged DFA. Thus we simply group the states according 
to their vectors of transitions by letters from E = {ai, . . . , To this end we 
store these vectors in a dictionary, organised as a trie of depth m. The leaf of 
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a trie corresponding to a path (gi, . . . , keeps a list of all states q such that 
6{q,ai) ~ Qi for every 1 < i < m. For each node v in the trie we keep a linear 
dictionary that maps a state q into a child of v. We demand that this linear 
dictionary supports search, insertion, deletion, and enumeration of all elements. 

Theorem 2. Given a total DFA M, we can build a distance forest for M using 
0{\M\logn) linear- dictionary operations. 

We now shortly discuss some possible implementations of the linear dictio- 
nary. An implementation using balanced trees would have linear space consump- 
tion and the essential operations would run in time O{logn). If we allow ran- 
domisation, then we can use dynamic hashing. It has a worst-case constant time 
look-up and an amortised expected constant time for updates jllj . Since it is 
natural to assume that log n is proportional to the size of a machine word, we 
can hash in constant time. We can obtain even better time bounds by turning 
to more powerful models. In the RAM model, we can use exponential search 
trees [1], whose time per operation is 0{ iog^o°^"g„ ) in linear space. Finally, if 
we allow a quadratic space consumption, which is still possible in sub-quadratic 
time, then we can allocate (but not initialise) a table of size |M| x n. Stan- 
dard methods can be used to keep track of the actually used table entries, so 
that we obtain a constant run-time for each operation, but at the expense of 
©(iMjn) space; i.e., quadratic memory consumption. 

We can now use a distance forest to efficiently implement /c-Minimise. For 
each state q we locate its highest ancestor Vq with level(wg) < k — in-level{q) . 
Then q can be merged into any state that occurs in the subtree rooted in Vq 
(assuming it has a smaller in-level). This can be done using a depth- first traversal 
on the trees of the distance forest. A more elaborate construction based on this 
approach yields the following. 

Theorem 3. Given a distance forest for M , we can compute the size of a 
k-minimal DFA that is k-similar to M for all k in time 0{\M\). For a fixed k, 
we can also compute a k-minimal DFA in time 0{\M\). Finally, we can run 
the algorithm in time 0{\M\ logn) such that it has a k-minimal DFA stored in 
memory in its k-th phase. 

3.3 Finite languages and partial transition functions 

The construction of a distance forest was based on a total transition function 5, 
and the run-time was bounded by the size of 5. We now show a modification for 
the non-total case. The main obstacle is the construction of a distance forest for 
an acyclic DFA. The remaining changes are relatively straightforward. 

Theorem 4. For every acyclic DFA M we can build a distance forest in time 
0{\M\\ogn). 

Proof (sketch). Since L{M) is finite, we have that m(p) = max {|it;| | ui G Lm{p)} 
is a natural number for every state p. Let Qi = {p\ m{p) = i} and Q<oo =[JiQi- 
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Every state has a finite right-language, and thus every distance forest consists of 
a single tree. We iteratively construct the fragments of this tree by starting from 
a single leaf _L, which represents the empty language and "undefinedness" of the 
transition function. Before we start to process Qt, we have already constructed 
the distance tree for 1J,<( Qi - The constructed fragments are connected to a single 
path, called the spine, which ends at the leaf _L (see Fig.[3|). 

Let Qt = {pi, . . . ,ps}, and let v £ Qt- Moreover, let f(w) be the vector of 
states V = {S{v,a))a(ES, where the coordinates are sorted by a fixed order on U. 
Define the distance between those vectors as 



where we know that d{pi, _L) = m{pi) and d{L,p[) = m{p^). Similarly to the dis- 
tance, we can define the father f(v) of a vector v = {pa)a£S as f (v) = {i{pa))a&s- 
Then 



We can now use a divide-and-conquer approach: First, for each vector we cal- 
culate its 2'""-th ancestor, where k = [logs/2]. Then all such vectors are sorted 
according to their ancestors, in particular they are partitioned into blocks with 
the same ancestors. After that we recurse onto those (bottom) blocks that have 
more than two entries and onto the upper block, which consists of the different 
2'^-ancestors. The recursion ends for blocks containing at most two vectors, for 
which we calculate the distance tree directly. □ 



For every state q € Q, its signature sig{q) is {a \ LM{S{q,a)) is infinite}. 
If sig(g) 7^ sig(p), then d{q,p) = oo, which allows us to keep a separate dic- 
tionary for each signature. Let us fix such a trie. To take into account also 
the transitions by letters outside the signature, we introduce a fresh letter $, 
whose transitions are represented in the trie as well. We organize them such that 
in phase £ the $-transitions for the states q and p are the same if and only if 
max{d((5(q, a),6{p,a))\a ^ sig{q)} < £ — 1. This is easily organised if the distance 
forest for all states with a finite right-language is supplied. 



d{{pa)aes, ip'a)aes) = max{d(K,p-) + 1 \ a £ S} 





Fig. 3. Illustration for the construction 
of the distance tree. The spine is de- 
picted using with a thicker line. Split- 
ting one fragment into smaller recursive 
calls is shown. 
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Theorem 5. Given a (non-total) DFA Al we can build a distance forest for it 
using 0{\M\\ogn) linear- dictionary operations. 

4 Hyper-equivalence and hyper-minimisation 

When considering minimisation with errors, it is natural that one would like 
to impose a bound on the total number of errors introduced by minimisation. 
In this section, we investigate whether given m, s G N and a DFA M we can 
construct a DFA N such that: 

(i) N is hypcr-equivalcnt to M; i.e., N ^ M, 

(ii) N has at most s states, and 

(iii) N commits at most m errors compared to M; i.e., \L{N) A L{M)\ < m. 

Let us call the general problem 'error-bounded hyper- minimisation'. We show 
that this problem is intractable (NP-hard) . Only having a bound on the number 
of errors allows us to return the original DFA, which commits no errors. 

To show NP-hardness of the problem we reduce the 3-colouring problem to 
it. Roughly speaking, we construct the DFA M from a graph G = (V, E) as 
follows. Each vertex v G V is represented by a state v £ Q, and each edge e G E 
is represented by a symbol e G S. We introduce additional states in a way such 
that their isomorphic copies are present in any minimal DFA that is hyper- 
equivalent to M. The additional states are needed to ensure that for every edge 
e = {vi,V2} € E the languages La/((5(ui, e)) and Xa/ ('5(w2, e)) differ. Now we 
assume that m= |£'|-(|y|— 2) and s ~ lA. We construct the DFA M such 
that all vertices of C Q are hyper-equivalent to each other and none is hyper- 
equivalent to any other state. We can save |y| — 3 states by merging all states 
of V into at most 3 states. These merges will cause at least \E\ ■ (\V\ — 2) errors. 
Additionally, 3 states will become superfluous after the merges, so that we can 
save \V\ states. There are two cases: 

- If the input graph G is 3-colourable by c: V — t- [3], then we can merge all 
states of c~^(i) into a single state for every i S [3]. Since c is proper, we 
never merge states wi, W2 S Q with {wi, W2} G iJ, which avoids further errors. 

- On the other hand, if G is not 3-colourable, then we merge at least two 
states vi,V2 & Q such that e = {wi,W2} G E. This merge additionally intro- 
duces 2 errors caused by the difference L{5(vi,e)) A L{5{v2,e))- 

Consequently, a DFA that (i) is hyper-equivalent to A/, (ii) has at most s states, 
and (iii) commits at most m errors exists if and only if G is 3-colourablc. This 
shows that error-bounded hyper-minimisation is NP-hard. 

Definition 4. We construct a DFA M ~ (Q, S, S, T, F) as follows: 

- Q = {T,±,oo,©,©}u V"u{Oj I o e {©,©,©},j e [3]}, 

- S = {a,b}LlVUE, 

- F = {oo,©}, 
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Fig. 4. DFA M constructed in Section [J] where a-transitions are represented by un- 
broken hues (unless noted otherwise), 6-transitions by dashed lines, and e = {ui,«„} 
and e' = {v2,vz} with vi < V2 < v-i < Vn- The hyper-equivalence ~ is indicated. 



— for every v G V , e — {vi,V2} G E with v ^ e. and Vi < V2, O G {©, ©} 





= V 


S{oo, a) 


= ©1 


5(00,6) = ©1 




S{v, e) 


= ©1 


S{vi,e) 


= ©1 


S{v2,e) = ©1 




5(©i,a) 


-©2 


<5(©2,a) 


= ©3 


<5(©3,a) = © 


(5(©3,6) = © 


^(Oi,a) 


= O2 


(5(02,0) 


- O3 


(5(03,a) = 


5(03,&) = S{0,b) 



00 



— For all remaining cases, we set S{q, a) 

Consequently, the DFA M has 14-1- \ V\ states (see Figure U]). Next, we show 
how to collapse hyper-equivalent states using a proper 3-colouring c : — !• [3] to 
obtain only 14 states. 

Definition 5. Let c: F — > [3] be a proper 3-colouring for G. We construct the 
DFA c{M) = (P, S, fi, T, F) where 

— P = {T,±,oo,©,©}U[3]U{0, IOG{©,©},je [3]}, 

— fi{p, a) = S(p, a) for all p € P \ {T, 1, 2, 3} and a G S, and 

— for every v gV, i G [3], and e — {vi,V2} G E with vi < V2 



/i(T, v) = c{v) 



©1 , j/c(w2) ^ i 
©1 , otherwise. 



Lemma 1. There exists a DFA that has at most 14 states and commits at most 
\E\ ■ (\V\ — 2) errors when compared to M if and only if G is 3-colourable. 

Corollary 1. 'Error-bounded hyper-minimisation' is NP-complete. More for- 
mally, given a DFA M and two integers m,s G poly(|M|), it is NP-complete to 
decide whether there is a DFA N with at most s states and \L{M) A L(N)\ < m. 
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5 Error-bounded ^-minimisation 

In Section [3] the number of errors between M and the constructed fc-minimal 
DFA was not calculated. In general, there is no unique fc-minimal DFA for M 
and the various fc-minimal DFAs for M can differ in the number of errors that 
they commit relative to M . Since several dependent merges arc performed in the 
course of fc-minimisation, the number of errors between the original DFA M and 
the resulting fc-minimal DFA is not necessarily the sum of the errors introduced 
for each merging step. This is due to the fact that errors made in one merge 
might be cancelled out in a subsequent merge. It is natural to ask, whether 
it is nevertheless possible to efficiently construct an optimal k-minimal DFA 
for M (i.e., a fc-minimal DFA with the least number of errors introduced). In the 
following we show that the construction of an optimal fc-minimal DFA for M is 
intractable (NP-hard). 

The intractability is shown by a reduction from the 3-colouring problem for 
a graph G = (V, E) in a similar, though much more refined, way as in Sec- 
tion m We again construct a DFA M with one state v for every vertex v G V 
and one letter e for each edge e G E. We introduce three additional states 
{lo,2o,3o} (besides others) to represent the 3 colours. For the following dis- 
cussion, let = {P, S, ^,po, F') be a fc--minimal DFA for AI. Let us fix an 
edge e ~ {vi,V2} G E. The DFA M is constructed such that the languages 
LM(^(wi,e)) and Lm{S{v2, e)) have a large but finite symmetric difference; as 
in the previous section, if a proper 3-colouring c: V ^ [3] exists the DFA N 
can be obtained by merging each state v into c(u)o- In addition, for every edge 
e = {vi,V2} € E and vertex u G e, we let /i(c(w)o,e) = S{v,e). On the other 
hand, if G admits no proper 3-colouring, then the DFA N is still obtained by 
state merges performed on M. However, because G has no proper 3-colouring, 
in the constructed DFA M there exist 2 states vi, V2 such that e = {vi,V2} G E 
and that both vi and V2 are merged into the same state p E P. Then the tran- 
sition /i(p, e) cannot match both S{vi, e) and S{v2, e). In order to make such an 
error costly, the left languages of v and v' arc designed to be large, but finite. In 
contrast, we can easily change the transitions of states {lo,2o,3o} by letters e 
because the left- languages of the states {Iq, 2o, 3o} are small. 

To keep the presentation simple, we will use two gadgets. The first one will 
enable us to make sure that two states cannot be merged: fc-similar states are 
also hyper-equivalent, so we can simply avoid undesired merges by making states 
hyper-inequivalent. Another gadget will be used to increase the in- level of certain 
states to a desired value. 

Lemma 2. For every congruence c^i C Q x Q on M , there exists a DFA N such 
that (i) Pi P2 for every pi £ P \ Q and P2 G P with pi ^ P2, and (H) qi q2 
in N for all qi ^ q2- 

In graphical illustrations, we use different shapes for qi and q2 to indicate 
that qi 7^ q2, because of the gadget of Lemma [2] Note that states with the same 
shape need not be fc-similar. 
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Lemma 3. For every subset S ^ Q \ {qo} of states and map min-lcvcl: S* — > N, 
there exists a DFA N = {Q U I, S U A, ^, qq, F) such that ~ 1 for every 

i £ I and in-levelM{s) > min-lcvel(.s) for every s £ S. 

We will indicate the level i below the state name in graphical illustrations. 
Moreover, we add a special feathered arrow to the state q, whenever the gadget 
is used for the state q to increase its level. 

Next, let us present the formal construction. Let G = {V, E) be an undirected 
graph. Select fc, s £ N such that s > log(|y|) + 2 and k > 4s. Moreover, let 
e^k~2s. 



3o 




Fig. 5. Illustration of the DFA A-I of Section [5] 
Definition 6. We construct the DFA M = (Q, 6, 0, F) as follows: 

- Q = {±,©,©,3o}U{i, \ ie [2],je [£]}uyu[0,s]U{©, \ Q<i<s}, 

- S = {a,b}UVUE, 

- F = {©,,, li}, and 

- for every v £ V , e = {vi, V2} (z E with v ^ e and vi < V2, i £ [s], and j G [i] 



6{i — 1, a) = i 


S{vi,e) = ©0 


S{lo 


e) = e 


S{l,-ub)^l, 


5{i -l,b)^i 


S{v2,e) = © 


Si2o 


e) = © 


(5(2,_i,6) = 2j 


5(©,_i,a) = ©, 


<5(i;,e) = e 


^(3o 


e) = © 


(5(1,, &) = ©, 


(5(©,_i,6) = ©, 


S{v, a) = li 






(5(2,, 6) = ©, 


5{9,a) = ©1 


(5(s, v) = V 









— For all remaining cases, we set 6{q,<T) ~ 
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Finally, we show how to collapse A:-similar states using a proper 3-eolouring 
c: 1^ -> [3]. We obtain the fc-similar DFA c{M) ^ {P,U,n,0,F) from M by 
merging each state v into c{v)o- In addition, for every edge e — {ui,W2} G 
we let /i(c(t;i)o, e) = S{vi,e) and /i(c(w2)o,e) = S{v2,e). Since the colouring c is 
proper, we have that c(wi) 7^ c('(;2), which yields that fi is well-defined. For the 
remaining i g [3] \ {c(wi), c(w2)}, we let /i(io, e) = ©o- AH equivalent states (i.e., 
± and ©) are merged. The gadgets that were added to M survive and are added 
to c(M). Naturally, if a certain state does no longer exist, then all transitions 
leading to or originating from it are deleted too. This applies for example to ®. 

Lemma 4. There exists a k-minimal DFA N for M with at most 

2^'^^ ■ \E\ ■ {\V\ - 2) + 3 • 2'*-^ • \E\ + 2"+^ • \V\ 

errors if and only if the input graph G is i- colourable. 

Corollary 2. 'Error-bounded k- minimisation' is NP-complete. 
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A Proofs and additional material for Section [2] 

Lemma 5. IJp,q € Q then d{p,q) < +00 implies that d{p,q) < n. 

Proof. Let Di denote the equivalence relation defined as Di{p,q) iff d{p,q) < i. 
Let Ui be the number of equivalence classes of D^, for i = 0, 1, . . . . Note that if 
Ui = rii^i then n,; = nj for all j > i and d{q, q') > i implies d{q, q') = +00. 

Since no < \Q\ the sequence hq > ni > ... stabilises at position n|Q|_i, 
i.e., there are no states q,q' such that D\Q\{q,q') and -iD\Q\_i{q, q'). Hence 
d{q,q') < +CXD implies D\Q\_i{q,q'), i.e., d{q,q') < n. □ 

B Proofs and additional material for Section [3] 

B.l Proofs and additional material for Section 13.11 

It can be shown that if M N then the states reached after reading the same 
word arc also fc-similar, assuming that the word is short enough. 

Lemma 6. Let M N, (71,(72 G Q, and wi,W2 G S* he such that 5{wi) — qi 
and \wi\ = in-levehiiqi) for i £ [2]. If qi ^k qi , then fJ-iwi) ^k [i{w-i). 

Proof. First, suppose that q\ 7^ 92. Then, M ^ N yields that 

H{wi) S{wi) = (?1 7^ (?2 = S{W2) ^ tJ-{w2) 

and thus p{wi) 7^ p{w2), which proves that T^fc p{w2). 

Second, let d{qi,q2) < 00. Since qi q2, we have 

92) + min(fc, liuil, |u;2|) > ■ 

Clearly, there exists u e L]\j{qi) A LM{q2) with \u\ > d{qi,q2) — 1. Moreover, 
\wiu\ > k < \w2u\. Since M ^k we have WiU,W2U ^ L{M) AL{N). Conse- 
quently 

u ^ LM{qi) ALn{h{wi)) and u <^ LM{q2) A Ln{ij.{w2)) . 

By assumption, u € L m {qi) A L m iq2) and thus u € Ln{p{wi)) A L]s[{n{w2)), 
which shows that d{p('Wi), fi{w2)) > \u\ + 1. Clearly, in-level j^{fi{wi)) > 
and in-level isi{p{w 2)) > |''^2|i which yields p{wi) 7^^ /^(u'2). □ 

We show some properties of fc-MiNiMiSE, which arc used to show that it 
properly constructs a fc-minimal DFA. Let N denote the DFA constructed by 
/c-MlNlMiSE at any particular point. 

Lemma 7. If5]^{p',w) ~ p and in-levelM^p) < k then 

\w\ < in-level]\{{p) — in-levelM(j>')- (2) 
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Proof. The assertion of the lemma is shown to be hold after each merge done by 
fc-MiNiMiSE, i.e., by the induction on the number of merges done by fc-MiNiMiSE. 
If there were no merges done yet then N = M and the claim holds true. Let N 
denote the DFA before the merge and N' after it. 

We focus on w = a E S ■ So assume that (5Ar/(pi,a) = p2 after merging 
state p to q. The only non-trivial case is when 6N{pi,a) ~ P and p2 = q, 
i.e., when something is changed after the merging. By induction assumption 
in-level M^J)) — in-level MiPi) > 1. As in-level M{q) > in-level m{p) as guaran- 
teed by fc-MiNiMiSE, the claim is obtained. 

When |w| > 1 it is enough to consider the states obtained after transitions 
after each letter of w and sum up the inequalities. □ 

Lemma 8. During the run of fc-MlNlMlSE for all p' £ Q{N), 



Proof. We establish this claim by induction. Let N' denote the DFA after merg- 
ing p to q and N just before this merge. Note, that a.s p ^ q (in M), thus 
d[p,q) > (in AI). Thus p q implies min(k,in-levelM{p),'i'n-levelMiq)) < k. 
Since p is merged to q by fc-MiNiMiSE, in-level m{p) < in-level M{q) and as p 
q also in-levelM{p) < k. Then by Lemma [2] we conclude that there is no word 
leading from g to p in N: assume for the sake of contradiction that there is such 
a word w. Since in-levelmip) < k, by Lemma [5] 



contradiction. Thus there is now word leading from q to p in N and therefore 
LAr(g) = LN'iq). 

For the other case, lest us first estimate d{L]y{p),LN'{q)) = d{LN{p), L]y{q)). 
As already noted, mhi(k,in-levelM{p)iin-levelM{q)) = in-levelM{p), which al- 
lows us to reduce p ^k q to 



d{LM{p'),L]^{p')) < ma.x{0,k ~ in-levehiip'))- 



(3) 



in-level m(p) > in-level M(q) + \w\> in-level M{q) 



d{LM{p),LM{q)) +in-levelM{q) < k 



and thus 



d{LM{p),LMiq)) <k- in-levelnip)- 



By induction assumption 



d{L M (q) , L N (q)) < max(0, k — in-level M^q)) 
d{LM{p), Ln{p)) < max(0, k — in-levelM{q)) 
= k — in-level m(ji) 



and as d is an ultra metric 



d{LN{p),LN{q)) <k - in-levehiip) ■ 



(4) 
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So consider an arbitrary state p'. If it has no word leading to p m N, then 
L N {p') ~ Lj^i [p') and we arc done. If it has a word w leading to p, then Lemma[7] 
can be applied, establishing: 

d{LN{p'),LN'{p')) = max \w\ + d{LN'{p), LN{q)) 

the former can be estimated by ^ and the latter by yielding 

d{LN{p'), Ln'{p')) < {in-level m{p) — in-levelM(p)) + {k — in-levelM{p)) 
< fc — in-levelM{p'), 

which ends the proof. □ 

Proof (of TheoremUl) . Let qo,qi, . . . ,qn be the starting states in DFAs M = Nq, 
Ni, Nn = N. By LemmalU d{LMiqi), Lpf^{qi)) < k. On the other hand, 
since qi is merged to gi+i then d{LM{qi), LM{qi+i)) < k. So all the languages in 
question are within distance k of each other and therefore 

d{L{M),L{N)) < k . 

Thus M^kN. It is left to show that N is fc-minimal. Consider the set of states 
Q' of N and let M' be a DFA fc-similar to M. By fc-MiNiMiSE, they are pairwise 
fc-dissimilar (as states in M). For a state q G Q' let Wq be the word such that 
\wq\ > min(m-ZeweZj\f (g), fc). Consider any two such words Wq and Wp. Then by 
Lemma [6] and Wp cannot lead to the same state in M'. Hence the size of M' 
is at least \Q'\, which is exactly the size of N. □ 

Corollary 3 (of Theorem [1]) . Each maximal (with respect to the inclusion) 
set Q' of pairwise k-dissimilar states of a DFA M is of size of the k-minimal 
DFA for M. 

Proof. First note that without loss of generality we may assume that 

p G Q' , q ^ Q' and p^uq implies in-levelM{p) > in-levelM{q)- (5) 

If not, then we can replace p by g in Q', without loosing the assumed property 
of Q' . After finitely many such substitutions, Q' satisfying ^ is obtained. 

Run fc-MiNiMiSE for M , and whenever there are two states p^kq considered, 
merge the one outside Q' to the one in Q' (do arbitrarily, if none is in Q'). Since 
Q' is maximal with respect to the inclusion, fc-MiNiMiSE terminates with the 
DFA with Q' as the set of states. □ 

Now we are able to establish a structural characterisation of fc-similar DFAs, 
analogous to characterisation of hyper-equivalent DFA's [31 Sect. 3.2]. In partic- 
ular, we derive the analogue of [Sj Theorem 3.8] for fc-similar DFAs. 

Corollary 4 (of Lemma [6] and Corollary [3|) . Let S C Q be a maximal set of 
pairwise k-dissimilar states of M , and let N be a k-minimal DFA for M . Then 
there exists a bisection h: S ^ P such that 
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— q ^ h{q) for every q Cz S , and 

— q = h[q) for every q Cz S such that in-levelMil) > k. 

Proof (of Corollary^. Wc have l^*! = \P\ by Corollary |3l For every q & S, let 
Wq € S^^{q) be such that \wq \ = in-levelM[q)- We define the mapping h: S ^ P 
by h[q) = fj-iuug) for every q £ S. Since M N, which yields M ^ N, we 
have q = S{wq) ^ fJ-iwq) = h{q). Finally, suppose that in-levelMiq) > k. Then 
LAiiq) = Ln[{5{wq)) = LN{fi{wq)) ~ L]y{h{q)) because M which yields 

q = h{q). □ 

B.2 Additional material for Section 13.21 

We refer to the distance tree we construct for the DFA M using the notation 
T>{M). We identify the leaves with the states of the DFA if this raises no con- 
fusion. To simplify the argument, we assume that _L is always in the 'D{M). We 
refer to a tree in a distance forest by a name of a distance tree. The vertices that 
are present in the compressed representation are called explicit, while those that 
were removed are called implicit. The standard terms father i(y) of a vertex v 
and ancestor always refer to implicit vertices. 



Algorithm 2 Distance-Tree 
1: for p £ Q do 

2: state(p) ^ p, level(p) <— 0, activate p 
3: for ^ = 1 to |n| do 

4: group active nodes according to ((5((state(ii), a))aei:) 
5: for eacli group of nodes V' such that \V'\ > 1 do 



6: choose v € V 

7: create active node v' , level(t;') — £, state(i'') = state(i;) 

8: for v" e V do 
9: join v" to v' , deactivate v" 

10: replace state(i'") in entries of 5 by state(i'') 



Lemma 9. For a minimised DFA M if Distance-Tree replaced the state q 
in 6 by p at phase (. then d{p, q) = £. 

Proof. The proof proceeds by induction on £. If £ = 1 then q and p have the 
same successors. Since q and p are not equivalent, their distance is exactly 1, as 
claimed. 

Suppose that q was replaced by p in phase £ > 1 . Since q was not replaced by 
p in phase £ — 1, by the induction assumption d{q,p) > d — I. Let (gi, . . . , q\s\) 
be the set of successors of q in the DFA and let {q'l,. . . ,q'\^\) be the vector of 
its successors in the representation in Distance-Tree. Since qi was replaced by 
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q[ in phase — 1 or earlier, d{qi,q[) < i — 1. Similarly, d{pi,ql) < £ — 1. Then, 
by O, 

d{p, q) = 1 + ma.xd{qi,pi) 

i—l 

< 1 + max(niax(d((7j,(j-),d((j-,pi))) 

i—l 

< £■ 

Consider now any other state r such that r was replaced by q in earlier phases. 
Then d{r, q) < i ~ I and thus d{r,p) ^ £. □ 

The bottleneck of Distance-Tree is the replacing of occurrences of g in 5 
by sonic other state p. We show that such replacing can be done in a way so 
that a single entry in 5 is modified at most log n times. 

Lemma 10. Distance-Tree can be implemented so that it alters every value 
of S at most log n times. 

Proof. The key modification needed in Distance-Tree is the choice of node 
V G v. For each state we introduce a counter c{p), initially set to 1, which 
keeps the track of how many states p represents. When we choose a node v G V 
we take the one with the largest c(state(w)). We update the value accordingly 
c(statc(u')) ^ Ev"&v' c(state(z;")). 

Note, that if we replace a value q by p in S, then c{p) > c{q) before the 
update of c and so c(p) > 2c{q) after the change. Thus if we replace the entry in 
6, the corresponding value of c at least doubles. Since c{p) is upper-bounded by 
n, each entry is replaced at most log n times. □ 

Proof (of Theorem\^. To allow fast replacing of entries q in the 5, for each state 
q used as the label in one of the dictionaries, we store an up-to-date list of its 
occurrences in all vectors in all dictionaries. 

When state p is merged into q we update the tries: we use the up-to date 
list of occurrences. For each occurrence of p in an internal node v we have the 
following situations: if v does not have a child labelled by q then we remove p 
from the linear dictionary, and insert q into it, pointing at the same child as p 
used to. If V has both children p and g, we have to merge their corresponding 
subtrees, rooted at vi and V2. We choose one of them, say vi, and insert each 
child of vi into subtrie of V2. Then we set pointer from qio V2. This might result 
in yet another situation of the same type, we do so recursively until we get to 
the leaves. 

The total cost of the case, when we did not need to merge linear dictionaries 
can be bounded similarly as in Lemma[TUl note, that after inserting q and deleting 
p from the linear dictionary, c{q) > 2c{p). Thus each such element is modified 
at most logn times and so the total cost is 0{\S\nlogn). 

If we do merge the linear dictionaries, we make a different analysis. For each 
linear dictionary we keep a counter, which calculates how many vertices were 
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inserted into this linear dictionary. When we merge two linear dictionaries, we 
remove the one with the smaller value of the counter and insert all its elements 
into the other dictionary. Then we sum the counters and update the remanding 
counter. 

Since each time a vertex is reinserted, the value of the counter in its linear 
dictionary at least doubles, and the maximal value of such counter is n, each 
vertex is inserted into a dictionary at most log n times. 

If we are to merge two leaves, we simply join their respective lists and remove 
one of the leaves. □ 

Now we can use the distance forest to our benefit. Instead of finding pairs 
of states that are fc-similar we proceed in another fashion: roughly speaking, for 
each state q we want to find the closest (with respect to d) state p satisfying 
in-level{p) > in-level{q), then, using this state, we want to judge, whether q 
is ever going to be merged to other state. To this end, we label the nodes of 
the I?(Af ) by states of the DFA M, formally we define state(?;) for each node v 
of T>{M). Since leaves of the T>{M) are identified with leaves, we obviously set 
state(g) = q for each such leaf. Then we label each inner node with one of the 
labels of its children, choosing the one with the maximal in-level m- 

For each state q let its submit node be the first node on the path from leaf q 
to the root labelled with a state different than q and let the submit state be the 
label of this node; let d{q) be the depth of the submit node q, if q is the label 
of the root, then d{q) = oo. Furthermore, define valiic{q) — in-level]\i{q) + d{q). 
The next lemma shows that value (q) can be used to approximate ^k- 

Lemma 11. Ifvahic{q) < k then q is k-similar to every state appearing as the 
label on the path from its submit state to the root. 
// value(p), value((7) > k then p Q- 

Proof. Suppose that value(g) < k and let q' be any node label above q's submit 
vertex (inclusively). Then 

min{in-level{q) , in-level{q')) + level(lca(g, q')) = in-level{q) + leve\(\ca.{q, q')) 

< in-level{q) + d{q) 
~ value(q) 
<k , 

and so q q' ■ 

Let value(g), value(g') > k, without loss of generality we may assume that 
in-level{q) < in-level{q') and q is not q'^s submit node (which could happen if 
in-level{q) = in-level{q') . Then d[q,q') ~ level(lca((7, g')) > d{q). Thus 

d{,q, q') + min{in-level{q) , in-level{q')) > d{q) + in-level{q) 

= value (g) 
> k , 



which concludes the proof. 



□ 
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For each state q let its /c-anccstor node v be the first node on the path from 
q to the root such that vahie(state(w)) > fc, and let fc-ancestor state be the label 
of V. 

Corollary 5 (of Lemma llip . The DFA obtained by merging each state q to 
its k-ancestor state is k-minimal and k-similar to M . 

The following theorem states easy consequences of Corollary [5] 

Proof (of Theorem\^. Calculate the labels in the 'D{M) as described earlier, 
this can be done using one depth-first traversal. Then calculate value(g) for each 
state q, this as well can be done using one depth-first traversal. Sort the pairs 
(g, value((j)) according to value((j), since Yahic{q) < 2n, this can be done in linear 
time using CountingSort. Note, that the number of states of fc-minimal DFA 
equals \ {q \ valuc{q) > fc}|, by CoroUaryEl which can be now easily computed 
in linear time. 

By Corollary [5] to obtain the /c-minimal DFA it is enough to merge each q 
with Yahie(q) < k to its fc-ancestor. A table of ancestor assigning to state q 
its fc-ancestor can be computed in linear time. The merging can be performed 
in 0(1^1) time, as we are only interested in the transition of the states q' such 
that value(?') > fc. We look through 5 and replace each entry S{q',a) — q by 
ancestor (q). 

There is a little subtlety: when replacing 6{q',a) = q by S{q',a) = q" we 
should take care that q ^ J-, as otherwise it would be impossible to bound the 
running time by \5\. However, note that if q" ~fc _L then the language of q" is fi- 
nite and therefore there is a path from g" to _L, hence in-level{q") < in-level{J-). 
Note, that when in-level for two states are equal, we arbitrarily choose one, so 
without loss of generality it can be assumed that _L is never merged to any other 
state. 

We now present a Od^l log n) algorithm, which at step fc = 0, 1, . . . , n has in 
memory the fc-minimal DFA. As previously, in step fc it will keep only states with 
value at greater than fc: it merges each existing state q such that vahie{q) = fc into 
its fc-ancestor q' which by the construction satisfies value((7') > value((7) + 1 > fc. 
To obtain the proper running time, we need only need to organise the data 
structures properly. It is represented by a list of transition: for each state q we 
list the pairs (a, q') such that S{a, q) = q' , for all valid a. Moreover, each q has a 
list of incoming transition, i.e., list of pointers to the transitions to it. Moreover, 
each state q has a counter rank((7), which describes how many states were merged 
to it. 

Assume that we merge p to q and rank(p) < iank{q). Then the situa- 
tion is easy. We redirect each transition to p into q, and perform the update 
rank(g) •(— rank((7) -I- rank(p). When rank(p) > rank(q) then we redirect each 
transition to q into p, and replace the outgoing transition from p by outgoing 
transitions from q. Since they are given as a list, this is done in (1) time. Then 
we rename p as 5 and update rank raiik^q) ^ Tank{q) + rank(p). 



20 P. Gawrychowski, A. Jez, A. Maletti 



Note, that each time an entry in 6 is modified, the rank of the target state 
doubles. Thus each transition is modified at most logn times and so the running 
time is 0{\S\ log n). 

Note, that while the running time 0{\6\ logn), outputting the results for each 
k might take a time up to logri). □ 



B.3 Additional material for Section 13.31 

Lemma 12. The total cost of maintaining the transition by $ in the tries is 
0{\5\logn) linear dictionary operations. 

Proof. Because L{M) is finite, m(p) = max{|z/;| : w £ is defined for any 

state p, let us denote = {p | m{p) = i} and (5<oo = [^^Qi■ 

Consider a DFA M' built on states (5<oo and their direct predecessors, i.e., 
Q'<oo = W '■ 3a G 5{q,a) G Q<,oo} U {-L}. Take the 5' restricted to input 
from Q'^oc and values in (5<oo- For each state q with infinite right- language 
and transitions into states in (5<oo we insert into vectors of successors ($, g'), or 
($, _L), if g has only undefined transitions. By Theorem|4]the cost of construction 
of T' for Al' is 0(1^1 logn). Using T' for M' in phase d we merge states from 
M' which are at distance d — 1 or less: it is enough to sort the nodes of T' 
according to their level and in phase d merge leaves in subtree of each node 
V such that level(u) < d. To perform the merging efficiently, each state in M' 
is assigned rank, which denotes the number of states that it represents. When 
states gi , . . . , are to be merged, we choose the one with the maximal rank, say 
Qi and replace of occurrence of gi, . . . , g^ in the trie by g^. Then we update the 
rank: rank(gi) X)j=i rank(gj). Thus each such entry in the trie is replaced at 
most log n times: whenever it is replaced, the corresponding rank doubles and it 
is upper bounded by n. □ 

Proof (of Theorem\B\). In linear time we can identify states such that their right- 
language is finite. By Theorem 2] their distance tree can be built in 0{\5 \ logn) 
time. 

Grouping of states in Distance-Tree is a by-product of using trie for the 
vectors of successors. Each state is deleted from set Q once, each such deletion 
results in an update of the trie. 

The number of linear dictionary operations for the letters in the respective 
signature can be upper bounded as in Theorem [21 that is, by {\5\\ogn). By 
Lemma [T2] the same bound applies to the construction and usage of the distance 
tree for states with finite right-language, □ 

Next, we comment how the ancestors and nodes are represented for the algo- 
rithm, as some of them are implicit: they are represented by an explicit vertex 
directly below them with an offset, i.e., a pair {v^t}. 

Recall, that we iteratively construct fragments of the distance forest, built 
on states Qt = {p \ m{p) = t}, i.e., recognising words of length at most t. For 
each already constructed fragment we do the preprocessing allowing efficient 
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computing the lea for any pair of vertices. There are known construction for 
doing this in constant time [l] however, they use the power of the full RAM 
model. For our purposes, the simple construction that keeps at every node a 
list of ancestors 2°, 2^, . . . , 2'°^^" higher allow performing the search in 0{\ogn) 
time, which does not influence the total running time in our case. As shown 
later, having the preprocessing performed for each such a fragment separately, 
is enough to execute lea-queries for the whole tree. 

Firstly we argue that indeed adding fragments built on states from Qt is 
reasonable. Moreover, if for some states to(p) 7^ iTi{q), d{p,q) can be calculated 
easily. 

Lemma 13. If m{p) ^ m{q) then d{p,q) = max(m{p) , m{q)) + 1. 
If m(p) = m{q) then d{p,q) < max(m(p), m(q)) + 1. 

Recall, that the spine is the path joining the state _L, which recognises an 
empty language, with the root of the T>{M). Note, that the spine has no com- 
pressed fragments, as the distance between _L and a a state p is equal to to(p) -I- 1, 
by Lemma [Tni moreover, by ([T]) it is easy to see that {m{p) : p G Q<oo} is equal 
to {0, 1,2,..., max{p S Q<cxj : 'm-ip)}- Therefore the spine is created beforehand 
as an uncompressed line of length max{p G Q<oo : m{p) + 1}, which is at most 

|Q<oo|- 

Lemma 14. The distance between w — ((ai,pi))ig/^ and 'v' = ((ai,p9)ig/2 can 
be computed using 0{\li \ + I/2I) lea-queries. 

Proof. Wc calculate their distance straight from the definition. This can be done 
using -I- I/2I lea queries, by going through consecutive elements of these 
vectors: as they are sorted, seeing [ai,pi) and (ai',p^/) we can decide whether 
Oi = a^/, in which case i ~ i' G 1x012, or Oi > a^/, and thus i G /i \/2; or < a.^', 
when i' G /2 \ /i- In the first case, we calculate d{pi,p[,), this can be done by 
comparing m{pi) and rn{p^,) and by calculating lca(pi,p-,), if m{pi) = m{p[,). 
This distance is compared with the current maximum. If i G /i \ /2 , wc compare 
the current maximum with m{pi) + 1; the situation for p'^, is symmetric. 

There are at most U I/2I lea queries used and at most as much other 
operations, which arc all performed in constant time. So the cost of the whole 
procedure can be charged to the U I/2I lea queries. □ 

To show that the total time of the construction of the the fragment for Qt 
is {\5t \ logt), where 5t is the transition function restricted to the input from Qt, 
we estimate separately the cost of finding the ancestors of the vectors and the 
cost of grouping of the vectors, according to their 2^^ successors. However, to 
properly estimate the time needed for that, we cannot use the whole already 
constructed part of the distance tree, as it may be vary large comparing to Qt- 
Thus, as a preprocessing step, we extract out of the distance tree the distance 
tree induced by the states appearing in the vectors. Being more precise, we 
calculate the subtrees 7^ for a G S: it is a sub-distance tree for states 5{Qt,a). 

Lemma 15. Constructing Ta for a G S can be done in time {\dt \ log |Qt|)- 
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Proof. Firstly we calculate the states in S{Qt,a) for each a, in total time Odi^il). 
Let us a fix a letter a. We sort the leaves in Ta in time {\Ta\ log IQf |): we assume 
that they are in some arbitrary, but fixed, order in V{M). Let them be 51, (72, 
. . . , Qs- We build Ta by successively adding leaves. Suppose that a tree for gi, . . . , 
Qi has already been built. The right-most path (names of nodes and their levels) 
is kept as a list. To add Qi+i, we calculate the lca(gi, qi+i) and remove from the 
list all nodes with smaller level. If the last element of the list, call it v, has level 
greater than \ca{qi,qi+i) we create a new inner node v' in Ta, insert it into the 
right-most path as the last element, make the right-most son of v the only child 
of v' and v' the new right-most son of v. Next we make g^+i a right-most child 
of the last node in the list (which might be v or v'). 

Since each node is inserted and removed from the right-most path at most 
once, the total running time is 0{\Ta\)- Summing up over all a G i7, we obtain 
that the total construction time is 0(|(5t|). □ 

Lemma 16. The total time of finding ancestors of vectors is 0{\5t\\ogn). 

Proof. Finding the ancestors is implemented naively: for each state q in the 
vector we traverse 'D{M) up 2^^ steps up from q. 

Consider a pair (a,p) in one of the vectors v and one of the edges e in 7^ on 
the way from p to the spine. We show that e is traversed at most [logt] times 
when constructing the distance tree for Q*. 

Note, that the recursive calls are made for k = [logt], [logt] — 1, . . . , 1. So 
it is enough to show that e is traversed once for a fixed value of k. 

Consider a given instance of a recursive call. Then when the sub-recursive 
calls are made, e goes to exactly one of these sub-calls, except when one of 
the 2'^-aneestor of leaves lays on e. However, in such a case there is no need to 
traverse e by any sub-call from the lower group: the lower end of e is an ancestor 
of all vertices in this tree. So we can modify the algorithm a bit: as soon as some 
edge is to be traversed, we check if the root lies on this edge. If so, this edge 
is not traversed, as searched node is implicit and therefore represented by the 
lower end and an offset. And so e is traversed at most once for each k, which 
concludes the proof. □ 

Lemma 17. Given k, the total size of grouping vectors in the recursive calls for 
k is linear in their size plus an additional cost, which is 0{\dt\ log |Qf |) summed 
over all k 

Proof. We want to sort lexicographically vectors Vi , . . . , of integers in the 
range 1, 2, . . . , n + This can be done in a standard way (say, usinf Radix- 
Sort) in time 0{n + \U\ + X]j=i I'^iD- This is too much, as n and \S\ might 
be large compared to Wj\- Thus we can do the following. In each Ta we 

can replace each node by a number from range 1, . . . ,2\Qt\ ~ I. Then instead of 
sorting according to names in 2?(A/), we use the local names from Ta. In this 
way the running time is 0{\Qt \ + |^| + X]j=i 

Still, for small instances, |(5t|-t-|i^| can be substantially larger than 
To avoid this problem, we process all the recursive call for a fixed k in parallel. 
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Then the sorting is done for all vectors in the recursive calls. Note, that if vec- 
tors come from different recursive calls, they cannot have the same non-trivial 
ancestors (and trivial, i.e., empty, ones can be identified and removed from the 
sorting beforehand). Thus the additional cost 0(|(3i| + is included once for 
each fc = 1, . . . , log \Qt\, and so in total gives 0{{\Qt\ + log |Qf |) time, which 



Two previous lemmata allow calculating the whole recursion time 

Lemma 18. The cost of the procedure for vectors vi, . . . , V£, excluding the ad- 
ditional cost o/ Od^tl log IQtj) from Lemma [77| is i{J2i=iWi\ ~ |v|) log |(5t|), 
where v is a lowest common ancestor o/ Vi , . . . , . 

Proof. First note, that if v is the lowest ancestor of vi, . . . , and v' is some 
ancestor of these vectors, then |v| > |v'|, and so the estimation using |v'| is 
weaker than the one using |v|. 

The claim is shown by an induction. Fix a constant c, for which it is shown 
that the total cost is at most c(^^^-^ |vi| — |v|) logs. 

The basis of the induction are calls for at most two vectors. No recursive call 
is made for one vector, and so we do not consider it. As observed in Lemma [HI 
when there are only two vectors v,v', the calculation is done using (v + v') lea 
queries. So c can be chosen in advance so that this is at most c(v + v') logt. 

When there are more than two vectors, b^ Lemma [TTl we can group them ac- 
cording to their ancestors in time at most X]i=i Since |v| < min^^j^ |vi|, con- 
stant c can be chosen in advance in the way that this is at most c(^^^-^ |vi| — |v|) 
time. 

Then there are sub-calls made. Let Ui, . . . , u^/ be the ancestors of vectors. 
Then the 'upper' recursive call, by the induction assumption, takes at most 
c(log s — — v) (note, that v is a common ancestor of Ui, . . . , u^/). The 

'lower' subcalls take, in total, time c(J2i=i Wi\^ Ei=i I "il) (logs — 1), since each 
Ui is a common ancestor of vectors for one of these subcalls. Hence all the calls 
take at most c{J2i=i Wi\ ~ |v|)(logs — 1) time. 

Summing the cost of the recursive calls and the grouping: 



isO(|(5t|log|Qt|). 



□ 




c(E l^il - M) + <11 l^il - |v|)(log5 - 1) < c(^ |vi| - |v|) logs. 



i—1 i—1 i—1 



as claimed. 



□ 



Proof (of Theorem^. By Lemma [TTl together with Lemma [T51 



C Additional material for Section [4] 



The DFA c(M) has 14 states, and it can easily be verified that it is hyper- 
equivalent to M and has no different, but equivalent kernel states. It is depicted 
in Figure [HI 
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e' 




Fig. 6. DFA c(M) constructed in Section 2] where a-transitions are represented by 
unbroken lines (unless noted otlierwise), 6-transitions are represented by dashed lines, 
and e — {vi,V2} and e' = {113,^4} with vi < V2 < W3 < U4 and c(wi) = 1, c{v2) = 3, 
c{vs,) = 1, and 0(174) = 2. The state _L is not depicted. 

Proof (of LemmaU^. Let N = {P, fj.,pQ, F') be a DFA such that M N 
and \P\ < 14. Without loss of generality we may assume that there are no 
different, but equivalent kernel states in N. Note that the DFA M is mini- 
mal provided that G has no vertex without any incident edge. By [31 Theo- 
rem 3.8]1 there exists a mapping h: Q ^ P such that q ~ h(q) for every q € Q, 
which additionally is an isomorphism h : Ker(7\jf ) — !• Ker(A^) between the kernel 
states. Since Ker(M) = {±,00,©,©} U {Oj \ O G {©,©}, j G [3]}, we have 
{00, ©} C F' and 10 distinct states in N that behave like their counterparts 
in M [i.e., LM{q) = LN{h{q)) for every q G Ker(M)]. Since T 7^ p and 5{v) ^ p 
for every p G Kcr(A^) and u G we conclude that h{~Y) ~ po ^ Ker(A^) and 
h{d{v)) ^ Ker(iV), which means that we identified the ll"' and 12'^ state in N 
because they are not hyper-equivalent to each other (i.e., T S{v) for every 
V € V). Consequently, there are at most 2 other unidentified states. 

Claim 1. The DFA N commits at least \E\ ■ {\V\ — 2) errors with prefix ve such 
that V G V is a vertex and e G i? is a non-incident edge. Exactly \E\ ■ {\V\ — 2) 
such errors are committed if /^(we) G {/i(©i), /i(©i)} for every u G V and e € E 
with V ^ e. 

Proof. Let u G be a vertex and e G -B be a non-incident edge, which means 
that V ^ e. Clearly, S{vea^^^) = 9j for all j G [3] and 7^ ®j for all dif- 
ferent i,j G [3]. Consequently, also the 3 states of 5 = {^{vea^~^) \ j G [3]} 
are pairwise hyper-inequivalent (and hence different). Since there are at most 
2 unidentified states, at least one state of S is already identified. Moreover, 

^ Actually, the cited theorem assumes to be minimal, but the original proof also 
works in our relaxed setting (i.e., when there are no different, but equivalent states 
in the kernel). 
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S{vea^-^) - n{vea^-^) for all j G [3] yields that ^j,{vea^-^) £ {h{©j),h{©.j)} 
for at least one j G [3], since ©j and ©j are the only states hyper-equivalent 
to ©j and /i(uea-'~^) ~ ©j. Since is a congruence and h is an isomorphism 
on Ker(Af), we obtain that ^{vea^) G {ft,(©3), /i(©3)} in any case. However, 
d{vea'^) = ®3, so we obtain the error word vea^x for some x £ {a, 6} because 

- LN{h{©^)) = La/(©3) and LN{h{©^)) = La/(©3), 

- iAf(03) ALw(/i(©3)) = LAf(03) ALm(©3) = {&}, and 

- iAf(03) ALw(/i(©3)) = Lm(03) ALAf(©3) = {a}. 

Moreover, if /i(we) G {/i(©i), /7,(©i)}, then there is exactly one error word 
with prefix we. Overall, this yields at least one error for every v G V and e E E 
with V ^ e. The total number of such errors is at least \E\ ■ {\V\ — 2), and it is 
exactly l^;] • {\V\ - 2) if fjL{ve) G {/i(©i), h{©i)} for every w G F and e G with 

V ^ e. □ 

Let S = {/^(v) I V G V^}. For every v E V, the state d{v) is not hyper- 
equivalent to any state of Ker(A^) and 6{v) / po, which yields that \S\ < 3; 
without loosing generality we can assume that l^j = 3, as this is the hardest 
case. Let 5* = {^1,^2,^3}, and let c: V [3] be such that for every t; G we 
have c{v) = i if and only if fj.(v) = pi. Thus, we deduced a 3-colouring from the 
transitions of A'^. Next, we investigate colouring violations and its connection 
with the number of errors with respect to M. For every edge e = {vi, V2} E E 
such that c(i'i) = c{v2), we have Lj^^jiivie)) = Ljv(/i(w2e)), but 

LM{5{vie)) ALM{5{v2e)) = La/(©i) A La/(©i) = {aaa.aab} . 

Consequently, wc obtain at least 2 additional errors for every such edge e because 
the corresponding error words start with ve where v E e. Those two errors 
together with the errors described in Claim [1] yield that N commits strictly 
more than \E\ ■ {\V\ — 2) errors if G is not 3-colourable. 

Finally, we show that the DFA c(M) = {P, S, iJ,,T,F), which has exactly 
14 states, commits exactly \E\ • (| V| 2) errors, if G is 3-colourable by the proper 
3-colouring c: V ^ [3]. By Claim[Tl c{M) commits exactly \E\ ■ {\V\ ~ 2) errors 
with prefix ve for v € V and e E E with v ^ e because n{ve) G {©i,©i} for 
all such V and e. Clearly, all remaining error words must start with ve for some 

V € V and e E E such that v G e. However, if ?j G e, then 5{ve) = fi(c(v)e) by 
the definition of c(il/), which yields that no error word with prefix ve exists. □ 

D Additional material for Section [5] 

Recall that M — {Q, S,S,qo, F) is our DFA and that we can always rename 
states to avoid conflicts. In particular, we assume that Q n N = 0. 

Lemma 19 (full version of Lemma [2])- For every congruence c:i C Q x Q 
on M , there exists a DFA N = (P, A, /i, qo, F') such that 

(i) LM{q) = LN{q) n S* and <5~^(g) = ^^H?) for every q €Q, 
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(n) iA/(qi) ALm('72) = LN{qi) ALN{q2) for all qi ~ q2, 
(in) pi p2 for every pi £ P \ Q and P2 ^ P with pi P2, and 
(iv) qi 7^ q2 in N for all qi q2- 

Proof. Let n be the index of c± and 9: ((5/~) [n] be an arbitrary bijection. 
Moreover, let 

— P ^QU[n] and F' ^ FU {n}, 

— A ^ U U {t,x} where t,x ^ S are different, new letters, 

— fi{q, a) = S{q, a) for every q £ Q and a £ 

— fjL{q,t) = 0{[q]^) for every q £ Q, and 

— fi{i, x) ~ i + 1 for every i £ [n — 1] and fJ-{n, x) = 1. 

Next, we discuss hyper-equivalence and fc-similarity in the DFA N. Clearly, i 9^ j 
for all different i,j £ [n]. Since ^ is a congruence, we obtain that gi 7^ 92 for all 
qi,q2 & Q such that qi 9^ ^2 -Moreover, i ^ q ioi every i £ [n] and q £ Q. The (i) 
is obvious. Finally, we can prove by an induction on the length of w G A* that 
w £ LM^qi) ALm(?2) if and only if w S Lpf{qi) ALjv(?2) for all qi ~ q2- □ 

In the future, we will use the construction in the proof of Lemma [19] as 
a gadget with n states and simply assume that we can enforce that qi 7^ q2 for 
every qi 9^ q2 and every congruence ~ with index n. This can be done since none 
of the newly added states is fc-similar to an existing state and all newly added 
states are pairwise dissimilar. Moreover, since we will only merge fc-similar states 
and fJ.{qi, t) = ^{q2, t) for all qi q2, the gadget does not introduce new errors. 

Lemma 20 (full version of Lemma [3]) . For every subset S C Q \ {qq} of 
states and mapping min-level: 5 — > there is a DFA N — {QUI, SUA, fi, qo, F) 
such that 

— Lmiq) = LN{q) for every q £Q\ {go}, 

— \^^^{i)\ = 1 for every i £ I, 

~ \6~^{s) A^~^{s)\ = 1 for s £ S with min-level(s) > 2 and 6~^{q) = ^~^{q) 
for all remaining q £ Q, and 

— in-levelpf{s) > min-level(s) for every s £ S. 

Proof. Let n = max {min-level(.s) | s e 5*} be the maximal requested level. We 
construct the DFA N such that 

— I = [n] (supposing that Q n N = 0), 

— A = {d} U {dg lags'} are new, different letters, 

— a) = S{q, a) for every q £ Q and a £ S, and 

— n{qo,d) = 1 and d) = i + 1 for every i £ [n — 1], 

— /i(min-level(s) — l,ds) ~ s for every s £ S such that min-level(s) > 2. 

Clearly, LM{q) = LN{q) for every q £ Q \ {qo} and in-level pf{s) > min-level(s) 
for every s £ S. Finally, ^~^{i) ~ {d*} for every i £ [n], which can be used to 
prove the remaining statements. □ 
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Wc can use the first gadget to make sure that all newly introduced states in 
the previous construction are /c-dissimilar. Mind that a renaming can be used to 
ensure that Q n N = 0. 

We use the gadgets of Lemmata [12] and [501 as follows: We select the states 
S = {lo, 2o, 3o, li, 2i, ©0, ©} and let min-level: S* ^ N be such that 

min-level(lo) = min-level(2o) — min-level(3o) = 3s — 1 
min-level(©o) = min-level(©) = min-level(li) = min-level(2i) = fc + 1 . 

Moreover, let ~ C Q x Q be the equivalence induced by the partition (single- 
element classes are omitted) 

{±,©,©}U{©, \ 0<i<s}U{ij\ie [2], J e [£]} {v\ve U {lo,2o,3o} . 

It can easily be checked that ~ is a congruence. Let M = {Q, U, S, 0, F) be the 
DFA obtained after adding the gadgets of Lemmata [12] and HD] using the above 
parameters. The obtained DFA M is illustrated in Fig. [5j We state some simple 
properties that follow immediately from the gadgets. 

Lemma 21. We observe the following simple properties: 

— Hyper-inequivalence is as indicated in Fig. 0' i.e., qi 7^ q2 for every qi 9^ (72. 

— The levels are as indicated in Fig. O More specifically, for j G [£] , z G [s — 1] , 
and V £ V . 

in-levelM{^o) = in-levelMi^a) = 3s — 1 in-levelM{-L) = cxd 
in-levelM{-^o) = 3s — 1 

in-levelMi^j) = in-levelM{2j) = k + j in-levelM{v) = s + 1 

in-levelM{®i) ~ k + i + 1 in-levelM{® s) = k + l+l 

in-levelM{9) ~ 3s in-leveluii) = « 

in-level i\ J {©) ~ k + 1 in-level i\i{s) = s 

Proof. Both properties follow immediately from Lemmata 1191 and 1201 and trivial 
inductions. □ 

Lemma 22. 

(i) The difference between the right-languages of the states {lo,2o,3o} is small. 
More precisely, 

LA/(lo)ALM(2o) = {a6^-i} 
LMilo)^LMi3o) = {ab'-\ab'} 
LA/(2o)ALM(3o) = {a6^} . 

(ii) Additionally, \S~^(v)\ = 2^* for every v e V, and 

\Lm{©,) a Lm{©)\ = 2'-' d{©,, ©)^s-i + l 
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\LM{©^)ALM{9)\>2'-^ d{©,,9) = s + l 

\Lm{9)ALm{©)\=2'-' d(©,©) = .s + l 

\Lm{Ij)ALm{9)\ > |Lm(©)| =2''-i d(l,,0) =max(s,£-j + l) + l 

|Lm(2,)ALm(«)| > |£m(©)| =2-'-^ d(2j,©) =niax(s,^-j- + l) + l 

for every < i < s and j G [£] . 
(Hi) Finally, 'is the reflexive and symmetric closure of 

= u{(io,«) 1 1 e [3],v e V}u{ivi,v2) I vi,v2 eV}Li 
U {(©, ©,) \ 0<t<s}U {(©, ©), (©, ±)} U {(©, t,) I I e [2],s + i<j<e}. 

Proof. Properties (i) and (ii) can be observed easily. We turn to (iii): let i,i' € [3] 
such that i ^ i' . Since in-level M{io) = in-level Mi^o) = 3s — 1 by Lemma [211 and 
dihji'i) > ^ — 1 by (i), we obtain d(zo, ig) + 3s > (fc — 2s) — 1 + 3s = fc + s — 1 > fc, 
which proves that io *o- Similarly, we can compute d(iQ,v) + s + 1 < 
(fc — 2s + 1) + s + 2 = fc — s + 3 < fc for every v ^ V, which proves io v. Now, 
let vi,V2 e V. Then 

in-level m{vi) = in-level m{v2) = s + 1 and d{vi, U2) < s + 2 , 

which yields 2s + 3 < k and proves that vi ^k ^2- Since in-level{9) = 3s 
and d{©Q, ©) = s + 1, we obtain 3s + s + 1 = 4s + 1 < fc, which proves that 
©0 ®- In essentially the same way, we can prove all similarities to ®, which 
yields that we proved all similarities. Clearly, two states qi,q2 G Q such that 
miii(in-levelM ill) , in-level m {12)) ^ k are fc-similar if and only if qi = q2, which 
proves the nontrivial dissimilarities. □ 

Using a proper 3-colouring c: y — [3] we define the DFA c(m). 

Definition 7. Let c: V ^ [3] be a 3-colouring and c{M) = (P, 17, /i, 0, F) he the 
DFA such that 

- P = {_L} U {ij I i G [2], j e [(]] U [0, s] U {©, I < i < s} 

— for every v € V , e = {vi,V2} G E with v ^ e and ui < W2, « G [s], j G [i], 
and j' G [3] 

^i{i-l,a) = i ^(©i_i,a) = ©j a) = Ij 

H{i-l,b)^i n{©i^i,b) = ©i /^(2j_i,a) = 2j 

^(s,w) = c(w)o n{li,a)^©s /i(2f,a) = ©s 

, ■ifc{v2) # j' 
, otherwise. 



Ai(.7o.e) = 



— For all remaining cases, we set fi{q,a) = _L. 

The DFA c(M) is illustrated in Figure[71 We first show that c{M) is fc-similar 
to M and then that it is fc-minimal. 
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Lemma 23. The constructed DFA c{M) = {P, ^,0, F) is k-similar to M. 

Proof. Let A be the alphabet of letters without the letters used by the gadgets, 
which are collected in P. We first observe the equalities 



Ln{©s) 


= Lm{©s) 


LN{©s)riA* 




Ln{±) 


= Lm{^) = Lm{©) 


LNi±)nA* 


= 




= Lm{1j) 


LAr(lj) n A* 


^ {b'-\b'-^+^} 




= Lm{2,) 


LN{2j)r\A* 






= Lm{©,) 


LAr(©,) n A* 


-{a, 5}— 




= 5-\^) 







In addition, since {ij \ i E [2],j G [£]} forms a single class of ~ in both 
c(M) and M it holds that 

Le(M)(ij) n A*ps* = Luiiy) n A*rs* 

for all € [2] and j,j' G [£]. From those statements and simple applications of 
the statements of Lemmas [211 and [22l we can easily conclude that M and c(M) 
are fc-similar. □ 




Note, that all states in c(M) are pairwise fc-dissimilar by Lemma Con- 
sequently, they form a maximal set of pairwise fc-dissimilar states in Af , which 
yields that |P| coincides with the number of states of all fc- minimal DFA for M 
by Lemma [3l Thus DFA AI is fc-minimal. 

We finally show that our construction is correct. 

Proof (of Lemma^. Let N = {P, S ,po, fi, F') be a DFA that is fc-minimal 
for M. We select a maximal set Q' of pairwise fc-dissimilar states by 



Q' - {±} u {i, 1 1 e [2], J e [£]} u [0, s] u {©, | < i < s} 
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Let h: Q' P he the bijection of Corollary |4] using the maximal set Q'. Let 

5 = {±} U {©, I < i < s} U {ij I i G [2],j 6 [£]} . 

Consequently, q = h{q) for every q E S. From M N, which yields M ^ N, 
we can conclude that S{w) fJ-iw) for every w G S* . Let w G {a, by for some 
< J < s. Then = because /^(w) ~ = i and h{i) is the only 

state p E P such that p ^ i. Now, let w = uv with u G {a, fe}'* and w G Since 
d{uv) = V and v ^ ig only for i G [3], we obtain ^(uv) G {/i(lo), /i(2o), /i(3o)}. 
From this behaviour we deduce a colouring c : y ^ [3] by c(d) = i if and only if 
fj,{uv) = h{io) for every v £ V . Note that this definition does not depend on the 
choice of u because fi{u) = /i(u') = h{s) for all u, u' G {a, 6}''. 

Claim 2. The DFA iV commits at least 

m = 2^'-^ ■ \E\ ■ {\V\ - 2) + 3 • 2*-^ • \E\ 

errors with prefix 

— uve where u G {a, b}* , u G V is a vertex, and e £ E such that v ^ e, or 

— dwe where d is the main symbol of the level gadget, w G S* ^ and e £ E. 

Exactly m such errors are committed if the following two conditions, called (f), 
are fulfilled: 

(i) ii{uve) G {h{©i),h{±)} for every u G {a, b}*, v eV and e <E E with v ^ e. 

(ii) ii{dwe) G {/i(©i), for every w £ E* and e G -E. 

Proof. We distinguish two cases: whether the error word goes through lo for 
some i G [3] or it goes through v for some v 

— The only way to arrive at in is via the level gadget. Then by Lemma [201 
there exists exactly one w G 5~^(jo) such that \w\ = 3s — 1 and w docs not 
end with v G V. In addition, (5(io,e) = ® for every e G E. However, in N 
we have ^(lu) G {/i(lo), h{2o), h{3o)} and fi{h{jo), e) G for every j G [3] 
because these are the only hyper-equivalent states. Since q = h{q) for every 
q G 5, we have 

|Lw(/i((Z))ALa/(«)| > 2-^-1 
\LN{h{l.))ALM{9)\ - |Lw(/^(©o))ALm(«)| =2-^-1 

for every g G S* by Lemma [221 11). This yields at least 2^~^ error words 
that start with we, and exactly 2^~^ such error words under condition (f). 
Consequently, there are at least 3 ■ 2''"-'^ ■ \E\ such error words in total, and 
exactly that many under condition (f ), because the level gadget is reproduced 
exactly. 
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— Next, we consider a word w ~ uve with u e {a,b}'^, v ^ V, and e £ E 
such that V ^ e. Clearly, S{w) — ©, but fJ,{w) G h{S) because h{S) are the 
only states of P that are hyper-equivalent to ©. Since q = h{q) for every 
q G S and = h{s) independently of u, there are 2'' such words u 
and \Lm{9) ALM{h{q))\ > 2"^^ for every q e S by Lemma [SSfii) be- 
cause Liq{h{q)) = Lm('?)- The same lemma also allows us to conclude 
that \Lm{9) l\LM{h{q))\ = 2'^^^ under condition (j"). Overall, there are 
2s . 2^5-1 . |£;| . _ 2) such error words, and exactly that many under 
condition (f). 

This proves the claim. □ 

Now we investigate the number of errors introduced for a colouring vio- 
lation in the 3-colouring c. Let e = {vi,V2} S be a violating edge, i.e., 
c{vi) = c{v2)- Then we have ^(uvie) = ^{uv2e) € h{S) for every u S {0,6}**. 
However, {S(uvie),6{uv2e)} ~ {©0,©}, which yields at least 2* ■ 2^ errors by 
Lemma [22111). Since s > logjdT^I) + 2, we have that 2'^" > 2"+^ ■ \V\, which, 
together with Claim [2] proves our statement if the graph is not 3-colourable. 

Finally, let us consider the errors of c(M) = (P, 17,^, 0,F) provided that 
c : y — > [3] is a proper 3-colouring for G. Recall the properties mentioned in the 
proof of Lemma [23l Since c(Ajf) fulfils property (|), we already identified exactly 
the errors of Claim[2] Clearly, all error words pass a state h{io) in N with i e [3], 
which yields that we only have to consider error words with prefix uve or uva 
where u € {a, b}*, v <E V, and e G E such that v G e. 

— We start with the prefix uva. As already observed we have that /i~^(s) = 
{0,6}" = 6^^{s). Moreover, fJ.{s,v) = io for all v G c^^(i), whereas we have 
5{s,v) = V. Thus, we consider Lj\f (w) A Lc(a/)(*o)- Since S{v,a) = li and 
/Lt(io, a) = ii, we obtain the potential errors 

{uvaV \je{e-l, £}, u G {a, &}"} . 

Consequently, we have 2*+^ ■ \V\ potential errors, all of which are of length 
at most s + 2 + £ = k- s + 2<k. 

— Finally, we have to consider the prefixes uve. If u G e, then ^(io, e) = S{v, e) 
by the construction of c{M) and so no errors are introduced. 

Summing up all identified error words, we obtain that the DFA c(M) commits 
at most 

2^"-^ ■ \E\ ■ {\V\ - 2) + 3 ■ 2'^^ ■ \E\ + 2-"+^ ■ \V\ 



errors, all of which are of length smaller than k, which also proves that the 
DFA c{M) is fc-similar to M. Moreover, since all of its states are pairwise k- 
dissimilar, it is also fc-minimal. □ 



